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Description 

FIELD OF THE INVENTION 

5 The present invention relates to a novel protein and a DNA coding for the same. More particularly, it relates to a 

novel protein having inhibitory activity on the protease activity of hepatocyte growth factor activating factor (HGF acti- 
vator) (hereinafter this protein in sometimes referred to also as "HAI-II'*), a gene coding for the protein, an expression 
vector containing the gene, a transformant as transformed with the expression vector, and a method of producing HAI- 
II using the transformant. 

10 

BACKGROUND OF THE INVENTION 

It is already reported that thrombin activates the precursor of HGF activator (JP-A-5- 103670. JP-A-6-141859, JP- 
A-6-1 53946 and JP-A-6-1 53966 (the term -JP-A" as used herein means an "unexamined putrfished Japanese patent 

IS application")); factor having activity to convert the single chain form of hepatocyte growth factor (HGF) to its double 
chain form) in the manner of positive activity control. However, any human tissues-derived protease inhibitor capable of 
inhibiting, as a negative control factor, the physiological activity of HGF activator has not been l^nown. Therefore, how 
HGF activator is controlled in human tissues remains unknown. Such a negative control factor might also influence indi- 
rectly on the activity of hepatocyte growth factor (HGF) on which HGF activator acts. Thus, for the analysis of the mech- 

so anism of action of HGF in vivo, as well, it has been demanded that such a human tissues-derived protease inhibitor be 
isolated and identified. 

By using such a protease inhibitor and an antibody to the protease inhibitor, it would become possible to know the 
in vivo physiological activity of HGF activator, analyze the mechanism of action thereof or analyze the mechanism of 
control of HGF activation, from a standpoint different from those of the prior art. 

25 Furthermore, for investigating the detailed in viyo function of HAI-il or the effect of HAI-II in hepatic disorder, for 
instance. HAI-II is required in large quantities. At present, however, there is only one method available for preparing 
HAI-II, which method comprises using, as a starting material, the culture supernatant obtained with a human cancer 
cell line such as MKN45 or A549 cells and purifying therefrom HAI-II occuning therein in trace amounts. This method 
is not always the best one from the labor, time and cost viewpoint. It encounters great difficulties in stably isolating the 

30 minor amount of HAI-II alone. Therefore, it has been desired that an expression system be constructed so that HAI-II 
can be obtained stably and in large quantities. 

SUMMARY OF THE INVENTION 

35 The present inventors have conducted screening of various cultured cell lines using, as an indicator, the inhibitory 
activity on the protease activity of hepatocyte growth factor activator and have found that a substance having the activity 
occurs in the culture supernatant of certain human cancer cell lines (MKN45 cells, A549 cells and like epithelial tumor 
cell lines). To reveal the nature of its inhibitory activity, they further attempted to purify the substance from the MKN45 
cell culture supernatant using various column chron^tography techniques. As a result, they have found a novel protein 

40 with a molecular weight of about 30.000 daltons as determined by SDS (sodium dedecyl sulfate)iDolyacrylamide gel 
electrophoresis(PAGE) and. they also have obtained an amino-terminal amino acid sequence of this protein by analyz- 
ing the protein on a protein sequencer. Further, they determined partial amino acid sequences by decomposing the pro- 
tein using proteolytic enzymes, isolating the resultant peptides and subjecting each peptide to the same amino acid 
sequence analysis as mentioned above. Furthermore, they estimated DNA base sequences based on the partial amino 

45 acid sequences and conducted screening of a cDNA library using oligonucleotide probes prepared based on the 
sequences. As a result, they have succeeded in cloning a gene coding for the protein and have now completed the 
present invention. 

Furthermore, as a result of various investigations to produce the protein stably and in large quantities using the 
recombinant DNA technique and, the present inventors have constructed a novel expression vector coding for the pro- 

50 tein and have enabled expression of the protein. Thus, by constructing a plasmid for protein expression by inserting a 
DNA fragment coding for part or the whole of the amino acid sequence of the protein into a plasmid vector such as the 
expression vector pMEISS for use in animal cells or an expression vector for use in yeasts, Escherichia COii and the 
like, at a site downstream from the promoter thereof and using the thus-obtained recombinant plasmid to transform host 
cells, they have now completed the present invention in another aspect. 

55 The present invention thus relates to a protein having the following physico-chemical properties: 

(1) a molecular weight of about 30,000 daltons as determined by SDS-polyacrylamide gel electrophoresis: 

(2) inhibitory activity on the protease activity of hepatocyte growth factor activator; and 

(3) one of the amino add sequences depicted in the sequence listing under SEQ ID NO:1 through 3 or an amino 
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acid sequence substantially equivalent thereto: proteins respectively having the amino acid sequences depicted in 
the sequence listing under SEQ ID NO:1 through 3 or amino add sequences substantially equivalent thereto and 
having inhibitory activity on the protease activity of hepatocyte growth fector activator; a protein having the amino 
acid sequence depicted in the sequence listing under SEQ ID N0:4 or on amino acid sequence substantially equiv- 

5 alent thereto; and a protein having, as its amino acid sequence, that segment of the amino acid sequence depicted 

in the sequence listing under SEQ ID NO:4 which starts with the 28th amino acid (alanine) residue and ends with 
the 252nd amino add (leudne] residue, or an amino acid sequence substantially equivalent thereto: DNAs and 
genes coding for the proteins defined above; expression vectors respectively containing the DNA or genes; 
transfbrnrants obtained by transformation of host cells with the expression vectors: as well as a method of produc- 

10 ing proteins having inhibitory activity on the protease activity of hepatocyte growth factor activator which comprises 
cultivating the transformants. 

The base sequence shown in the sequence listing under SEQ ID NO:4 contains only one strand, with the other 
complementary base sequence being omitted. Starting with this gone and using the recombinartt DNA technology, it is 

IS possible to cause expression of, for example, the protein having the amino add sequence shown in the sequence listing 
under SEQ ID N0:4. On that occasion, the protein translated from mRNA coding for the protein contains a signal 
sequence. After extracellular excretion, however, the signal sequence has been deaved off and the protein obtained 
has an amino acid sequence comprising the 28th amino acid (alanine) residue end the subsequent amino acid residues 
of the amino acid sequence shown in the sequence listing under SEQ ID NO:4. Signal sequences of other proteins may 

20 also be used as the signal sequence. For signal sequence-free mature protein expression in host cells, a gene having 
that portion of the base sequence shown in the sequence listing under SEQ ID NO:4 which comprises the 82nd nude- 
otide (guanine) residue and the sut>sequent nudeotide residues may be used as the gene coding for the relevant pro- 
tein and joined to the ATG codon of a vector. The present invention further includes, within the scope thereof, 
modifications of the proteins or DNAs mentioned above as derived therefrom by deletion, substitution and/or addition 

25 of one or more amino acid or nucleotide residues within limits not harmful to the inhibitory activity on the protease activ- 
ity of HGF activator, namely those proteins or DNAs that respectively have "substantially equivalent amino acid 
sequences'* or "substantially equivalent base sequences". 

BRIEF DESCRIPTION OF THE DRAWINGS 

30 

Fig, 1 shows the results of assaying of the protein of the present application for its inhibitory activity on the protease 
activity of HGF activator. 

Fig. 2 shows the structure of the plasmid pME18S-HAI-ll. 

35 DETAILED DESCRIPTION OF THE INVENTION 

In tiie following, the present invention is described in further detail. The novel protein of the present invention which 
has protease inhibitor activity can be obtained by proceeding via such purification steps as mentioned below. For exam- 
ple, a human cancer cell line (MKN45 cells or A549 cells deposited with the Japanese Cancer Research Resources 

40 Bank under tiie deposite numbers JCRB0254 and JCRB0076. respectively, or like epitherlial tumor cell line) is culti- 
vated in a serum-free medium for several days, the culture supernatant is recovered and, after removal of cells there- 
from and concentration, submitted to a heparin-Sepharose column (available e.g. from Pharmada). The non-adsorbed 
fraction is submitted to a ConA-Sepharose column (available e.g. from Pharmacia) and separated into an adsorbed 
fraction and an non-adsorbed fraction. The non-adsorbed fraction is subjected to hydrophobic chromatography using 

45 Phenyl-5PW (available e.g. from Tosoh Corp.). The thus-obtained fraction containing tiie desired protein is chromato- 
graphed on a DEAE ion exchange column (available e.g. from Polymer Laboratory), then submitted to a hydroxyapatite 
column (available e.g. from Mitsui Toatsu Chemicals or Seikagaku Corp.), and further to gel filtration column chroma- 
tography (using e.g. Asahi Chemical Industry's GS520) to give tine protein in question. The purification steps may fur- 
ther include reversed phase column chromatography and/or other appropriate means, as necessary. 

so Upon SDS-polyacrylamide gel electrophoresis, the thus-purified protein of the present invention migrates as a 
smear band or several fragments presumably resulting from differences in sugar chain, amino acid residue modification 
and/or C-terminal side mutation and having a molecular weight of about 30,000 daltons. When reacted with HGF acti- 
vator, the protein shows inhibitory activity on the protease activity of HGF activator. This protein of the present invention 
contains the amino acid sequence shown in Table 1 below. 

ss A DNA fragment of the gene coding for the novel protein of the present invention can be obtained in the following 
manner. By analyzing the novel protein purified in the above manner using a gaseous phase protein sequencer (avail- 
able e.g. from Applied Biosystems). its amino-termtnal amino add sequence can be determined- Further, the protein is 
decomposed using lysyl endopeptidase (e.g. Achromobacter protease I), the resulting peptide fragments are separated 
by reversed phas high-performanc liquid chromatography (using e.g. a YMC's column) and each fragment is sub- 
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jected to amino acid sequence analysis in the same manner as mentioned above, whereby the amino acid sequence 
of an intermediate portion of the protein can be revealed. 

A DNA base sequenc is deduced from the amino acid sequence thus determined and. correspondingly, appropri- 
ate oligonucleotides are synthesized and used as probes. Human-derived liver, spleen and placenta cDNA libraries 

5 (available e.g. from Clonetec), among others, can be used as the cDNA library for screening out the gene coding for the 
desired protein. In addition, a cDNA library may be constructed in the conventional manner from & cell line or tissue 
material in which the protein is expressed. 

Escherichia coll is transfected with X phage containing such cDNA incorporated therein (the method of Maniatis et 
al.: "Molecular Cloning") and then cultivated. The plaques formed are subjected to selection by plaque hybridization 

10 using oligonucleotide probes prepared based on the t>ase sequence deduced from the amino acid sequence of a por- 
tion of the protein in question, whereby a certain number of different X phage clones having the amino acid sequence 
of the desired protein and containing, in addition, those segment base sequences of the protein that correspond to 
other regions than the probes can be otrtained with ease. 

Then, the phage from each positive plaque obtained in the above screening is then allowed to replicate by the 

15 method of Maniatis et aJ., and DNA is purified therefrom by the glycerol gradient method and, after appropriate restric- 
tion enzyme cleavage, submitted to cDNA subcloning into a plasmid vector such as pUC18 or pUC19 or a single chain 
phage vector such as M13mp18 or M13mp19. Thereafter, the base sequence of the desired cDNA fragment can be 
determined by the method of Sanger et al. The base sequences of the clones obtained are analyzed and synthesized 
and, as a result, a gene totally corresponding to the whole amino acid sequence of the desired protein as shown in the 

20 sequence listing under SEQ ID N0:4 can be derived from a group of cDNAs coding for respective portions of the pro- 
tein. It is also possible to obtain a gene containing the whole of the cDNA in question, a gene containing the cDNA with 
deletion of a partial base sequence thereof, a gene containing the cDNA with insertion of some other base sequence, 
a gene containing the cDNA with substitution of some other base sequence for a partial base sequence of the cDNA. 
or the like gene from a variety of cDNA libraries by the PGR method using portions of the cDNA in question as probes. 

25 Such site-specif ic mutation, inclusive of base sequence deletion, addition or substitution, can be readily realized by the 
methods described in the literature (e.g. Methods in Enzymol.. 217. 218 (1993); ibjd., 217. 270 (1993)). 

The group of cDNAs otrtained in the alxwe manner are joined together so that the order of the base sequences is 
fit to the amino acid sequence of the protein, to give a DNA fragment covering the whole region of the protein. The DNA 
fragment is inserted into a plasmid. such as pCDL-SRa296, at a site downstream from the promoter thereof and 

30 matched in phase with the translation initiation codon ATG, to thereby construct a protein expression vector. Then, the 
protein can be expressed in a host, for example animal cells, transformed with the plasmid. Thereafter, the protein 
expressed can be recovered by purification by a conventional method. 

Thus, each of the thus-obtained cDNAs is inserted into a plasmid, such as pMEISS. at a site downstream from the 
promoter thereof to thereby Construct a plasmid for protein expression. The protein or a protein derived therefrom by 

35 partial amino acid sequence deletion, insertion or substitution can be expressed in a host, such as animal cells, trans- 
formed with the expression plasmid. More concretely, CHO cells. COS cells, mouse L cells, mouse C127 cells, mouse 
FM3A cells and the like can be used as the animal cells for protein expression. When these animal cells are used as 
the host, the use, as a signal sequence, of that portion of the DNA base sequence shown in the sequence listing under 
SEQ ID NO:4. namely the gene for the protein, which starts with the 1st nucleotide and ends with the 35th nucleotide. 

40 or the use of an existing signal sequence is expected to be conducive to extracellular secretory production of the protein 
or production thereof on the cell membrane. 

The expression plasmid for use in animal cell hosts is constructed in the following manner. As the promoter, use 
can be made of any of the existing promoters, for example the SRa promoter, SV40 promoter or metallothionein gene 
promoter. A DNA containing the whole gene for the protein, inclusive of the above-mentioned signal-like sequence, a 

45 DNA containing the gene with a partial base sequence deletion, a DNA containing the gene with insertion of a base 
sequence or a DNA containing the gene with substitution of some other sequence for a partial base sequence thereof 
is inserted into a site downstream from the promoter in the direction of transcription. In constructing the expression 
plasmid for the protein, two or three pieces of the DNA fragment of the gene coding for the protein may be joined 
together and used for insertion downstream from the promoter. It is also possible to join such a promoter as the SV40 

50 promoter to the 5' upstream side of the DNA fragment of the gene coding for the protein to give a unit insert and insert, 
into a vector, two or three such units joined together in the same direction of transcription, A polyadenylation site is 
added to the downstream side of the gene coding for the protein. For example, the polyadenylation site derived from 
the SV40 DNA, p-globin gene or metallothionein gene can be joined to the downstream side of the gene coding for the 
protein. When a DNA fragment comprising a pronrrater and the gene coding for the protein as joined together is dupli- 

55 cated or triplicated, each unit may contain a polyadenylation site on the 3' side of the gene coding for the protein. In 
transforming animal cells, for exanple CHO cells, with such expression vector, a drug resistance gene can be used for 
the purpose of expression cell selection. As the drug resistance gene, there may be mentioned the DHFR gene which 
provides resistance to methotrexate (J. Mol. Biol., 159. 601 (1982)). the Neo gene which provides resistance to the anti- 
biotic G-418 (J. Mol. Appl. Genet,. 1, 327 (1982)), the Escherichia coli<lerived Ecogpt gene which provides resistance 
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to mycophenolic add (Proc. Natl. Acad. Sci. U.S.A., 7S. 2072 (1981)) and the hph gene which provides resistance to 
the antibiotic hygromydn (Mol. Cell. Biol., 5. 410 (1985)), among others. Each resistance gene contains a promoter, 
such as the above-mentioned SV40-derived promoter, inserted on the 5' upstream aide and a polyadenylation site as 
mentioned abov on the 3' downstream side of each resistance gene. In inserting such r sistance gene into the expres- 
sion vector for the protein, the gene may be inserted at a site downstream from the polyadenylation site of the gene cod- 
ing for the protein, in either direction, the same or opposite. TTiese expression vectors make it unnecessary to perform 
double transffornrration with another plasmid Containing a selective marker gene for the purpose of transformant isola- 
tion. When the expression vector for the protein does not contain such a selective marker gene insert, a vector having 
a marker suited for transformant selection, for example pSV2neo (J. Mol. Appl. Genet., 1 327 (1982)). pl^BG (Nature. 
294. 228 (1981)), pSV2gpt (Proc. Natl. Acad. Sd. U.S.A.. 28. 2072 (1981)) or pAd-D26- 1 (J. Mol. Biol.. 159, 601 
(1982)), may be used, in combination with the expression vector for the gene coding for the protein, for transformation 
to thereby make it easy to perform transformant selection based on phenotypic expression of the drug resistance gene. 

The expression vector can be introduced into animal cells by the caldum phosphate method (Virology. 52. 456 
(1973)) or the electroporation method (J. Menribr. Biol., HL 279 (1972)), for instance. The transformed animal cells can 
be cultivated in the conventional rnanner in the manner of suspension culture or adhesion culture. They are cultivated 
in a medium such as MEM or RPMI 1640 in the presence of 5 to 10% of serum or in the presence of an appropriate 
amount of insulin, transfen-in or the like, or under serum-free conditions. Further, rt is also possible to produce the pro- 
tein using microorganisms such as yeants or Escherichia coli. for example strains of Saccharomvces cerevisiae or the 
strain Escherichia coli YA-21 . since the cells express the protein in tiie culture supernatant or on the cell surface, it is 
possible to recover and purify tiie protein using the culture supernatant or cells of this transformant. More specifically, 
the protein can be isolated and purified by subjecting the culture supernatant or cell extract containing the protein pro- 
duced to an appropriate chromatography procedure, for example chromatographic treatment using heparin-Sepharose. 
ConA-Sepharose, hydroxyapatite and the like in combination. 

The protease inhibitor activity-eridowed protein of the present invention has inhibitory activity on the protease activ- 
ity of HGF activator and. therefore, is useful as a in vitro or in vivo regulatory factor for HGF activator or, indirectly, as a 
HGF activity regulating factor. The protein as well as an antibody to the protein or a gene coding for the protein is further 
useful as a tool or means for function analysis of the factors. 

Furthermore, by introducing an expression vector can-ying a gone coding for the protein into animal cells, it 
becomes possible to produce part or the whole of the protein or a protein equivalent tfiereto, which is biologically active, 
in a stable manner and in large quantities. This has so far been difficult to attain. 

The present invention is now illustrated in greater detail with reference to the following Examples. However, it is not 
intended that the present invention be limited to these Examples. 

EXAMPLE 1 

(Purification of the protein using an MKN45 cell culture supernatant) 

MKN45 cells (Naito et al., Gan to Kagaku-ryoho (Cancer and Chemotfierapy), S. 89 (1978)) (obtained from Meneki 
Seibutsu Kenkyusho) were seeded into eRDF medium containing 5% FBS (fetal bovine serum) as placed in a roller bot- 
tle 850 and allowed to multiplicate until a confluent state was attained. Then, the FBS-containing culture supernatants 
was removed and the cells were washed witii two portions of serum-free eRDF medium. After removing the washing 
medium. 500 ml of serum-free eRDF medium was added and incutation was can-ied out at 37*C for 3 to 6 days. After 
incubation, tiie culture supernatants was recovered, 500 ml of fresh serum-free eRDF medium was added, and incuba- 
tion was again conducted. This procedure was repeated several times. The culture supernatants thus recovered were 
combined and concentrated about 20-fbld using a YM30 ultrafiltration membrane (Amicon). 

This concentrate was submitted to a heparin-Sepharose column (equilibrated with PBS) and the non-adsorbed 
fraction was recovered. This fraction was submitted to a ConA-Sepharose column (equilibrated with PBS) and sepa- 
rated into the non-adsorbed fraction and an adsorbed fraction eluted with a PBS solution containing 200 mM a-methyl- 
D-mannoside. The ConA non-adsorbed fraction was concentrated using YM30. followed by buffer substitiJtion to 10 mM 
phosphate buffer (pH 6.8) containing 1 M ammonium sulfate. The new solution was subjected to HPLC using Phenyl- 
5PW (Tosoh Corp.: equilibrated with 10 mM phosphate buffer (pH 6.8) containing 1 M ammonium sulfate), followed by 
linear concentration gradient elution with 1 M ammonium sulfate to 0 M ammonium sulfate. A fraction containing the 
desired protease inhibitor activity was thus recovered. 

The fraction was dialysed against 20 mM Tris-hydrochloride buffer (pH 8) containing 0.05% CHAPS and tinen sufc>- 
jected to HPLC using DEAE (equilibrated with 20 mM Tris-hydrochloride buffer (pH 8) containing 0.05% CHAPS), fol- 
lowed by linear concentration gradient elution with 0 M to 500 mM NaCI, whereby a fraction showing the desired 
protease inhibitor activity was recovered. The fraction was dialyzed against 5 mM phosphate txjffer (pH 6.8) containing 
0.05% CHAPS and then subjected to HPLC using a HCA A-4007 column (product of Mitsui Toatsu Chemicals) (equili- 
brated with 5 mM phosphate buffer (pH 6.8) containing 0.05% CHAPS), and the non-adsorbed fraction was recovered. 
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The fraction was submrtted to GS-520 (equilibrated with PBS containing 0.05% CHAPS) and an activ fraction (fraction 
of about 40 to 20 kDa) was recovered. For eliminating minor bands, the fraction was applied to a YMC pack C4 column 
(obtained from YMC), linear concentration gradient elution was carried out over 40 minutes using acetonitrile-isopropyl 
alcohol (3/7) containing 0.1% TFA and varying the concentration thereof from 10% to 50%, and the active fraction was 
neutralized with 1 M Tris-hydrochloride buffer (pH 8) and then dried under reduced pressure. After drying, the solid 
obtained was dissolved in PBS containing 0.05% CHAPS to give a purified protein solution. 

EXAMPLE 2 

(Amino-terminal amino add sequence and partial amino acid sequence determination of the protein) 

The protein having protease inhibrtor activity as purified as in Exanrple 1 and eluted by reversed phase HPLC was 
dried under reduced pressure without neutralization, "mis was dissolved in 60 jal of 50% TFA (trifluoroacetic acid), 
added to a polybrene-treated glass filter and subjected to Edman degradation on an Applied Biosystems model 470 A 
sequencer, and the amino add sequence of an N-terminal region was determined. Phenylhydantoin (PTH)-amino acids 
were identified using a Mitsubishi Chemical's MCI gel CDS IHU column (0.46 x 15 cm) and conducting single solvent 
elution with acetate buffer (10 mM acetate buffer (pH 4.7), 0.01% SDS. 38% acetonitrile) at aflow rate of 1 .2 ml/minute 
and a temperature of 43**C. PTH-amino acids were detected based on the absoibance at 269 nm. 

As a result, the N-terminal amino acid sequence shown below in Table 1 was identified. 

Then, the same protein having protease inhibitor activity as purified as in Example 1 and eluted by reversed phase 
HPLC was dissolved in 100 ^l of 50 mM Tris-hydrochloride buffer (pH 9.0) containing 4 M urea, lysyl endopeptidase 
(Achromobacter protease I) was added to the solution, and the reaction was carried out at 37°C for 8 hours. The result- 
ing peptide mixture was separated by reversed phase HPLC using a Yf^C pack C8 column (YMC) to give respective 
peptide fragments. Two peptides were subjected to amino acid analysis using a gaseous phase sequencer (Applied 
Biosystems model 1470A). The sequences shown in Table 1 were found. 

TABLE 1 

Amino acid sequences of peptides 

N-terminal: Ala-Asp-Arg-Glu-Arg-Ser-lle-His-Asp-Phe-Xaa-Leu-Val-Ser-Lys (SEQ ID NO:1 in 
the sequence listing) 

Partial amino add sequences 

1 : Lys-Val-Val-Gly-Arg-Xaa-Arg-Ala-Ser-Met-Pro-Arg-Trp-Trp-Tyr-Asn-Val- Thr-Asp-Gly-Ser- 
Xaa-GIn-Leu-Phe-Val-Tyr-Gly-Gly (SEQ ID NO:2 in the sequence listing) 
2: Ala-Thr-Val-Thr-Glu-Asn-Ala-Thr-Gly-Asp-Leu-Ala-Thr-Ser-Arg-Asn-Ala- Ala-Asp-Ser-Ser- 
Val-Pro-Ser-Ala-Pro (SEQ ID NO: 3 in the sequence listing) 

(Xaa: amino acid residue not yet identified) 



EXAMPLE 3 

(Purification of the protein using an A549 cell culture supernatant and amino acid sequence analysis) 

A culture supernatant was prepared by cultivating A549 cells (obtained from the Japanese Cancer Research 
Resources Banl^ in the same manner as in Example 1 . Using the culture supernatant and proceeding in the same man- 
ner as in Example 1 . a protein having the inhibitory activity on the protease activity of HGF activator was obtained. Upon 
SDS-PAGE, this protein showed the same molecular weight as that derived from MKN45 cells. When subjected to the 
same N-terminal amino acid sequence determination as in Example 1. this protein gave the same sequence as that of 
the MKN45 cell-derived protein. This suggested the possibility of the protein being identical with the MKN45-derived 
protein. 
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EXAMPLE 4 

(Method of assaying the activity of the protein inhibiting the protease activity of HGF activator as well as the activity) 

5 One to ten ^l of the sample to be assayed was added to 30 to 40 >J of PBS-0.05% CHAPS solution containing 2 to 

5 ng of serum-derived HGF activator. After 30 minutes of incubation at 37**C, 5 to 1 0 ng of single chain HGF was added 
and incubation was further continued for 2 hours. This incubation mixture was subjected to SDS-polyacrylamide gel 
electrophoresis under reducing conditions. After electrophoresis. Coomassie Brilliant Blue R250 (CBB) staining was 
performed and the proportions of single chain HGF and double chain HGF were compared for activity detection. 

10 The purified protein (10 ng) and 5 ng of serum-derived HGF activator were incubated in 30 to 40 nl of PBS-0.05% 
CHAPS solution at 37**C for 30 minutes, then 10 ng of single chain HGF was added, and incubation was further contin- 
ued for 2 hours. The incubation mixture was subjected to SDS-polyacrylamide gel electrophoresis under reducing con- 
ditions followed by staining with CBB. The results are shown in Rg. 1. In the figure, the numeral 1 indicates the case 
where neither of HGF activator and the protein was added. 2 indicates the case where HGF activator was added but 

IS the protein was not added, and 3 Indicates the case where HGF activator and the protein were added. Addition of the 
protein resulted in suppression of tiie activity of HGF activator converting single chain HGF to double chain HGF 

EXAMPLE 5 

20 (SDS-polyacrylamide gel electrophoresis) 

For determining the sipparent molecular weight of the protein having protease inhibitor activity as purified from the 
MKN45 cell cufture supernatant or A549 cell culture supernatant in Example 1 or Example 2. the protein was subjected 
to SDS-polyacrylamide gel electrophoresis. The protein finally purified was subjected to SDSisolyacrylamide gel elec- 

25 trophoresis using 12.5% polyacrylamide slab gels, which was conducted under nonreducing conditions. The molecular 
weight markers used were Molecular weight markers "Daiichi" III for Laemmli method (Daiichi Pure Chemicals). After 
electirophoresis. color development was performed using a silver stain reagent (Kanto Chemical). Upon relative com- 
parison in migration distance between the protein and the molecular weight markers, the protein obtained from the 
MKN45 cell culture supernatant or A549 cell culture supernatant showed several fragments or a smear band, presum- 

30 ably due to differences in sugar chain, amino acid residue modification or terminal region, at positions around an appar- 
ent molecular weight of about 30.000 daltons as determined by SDS-polyacrylamide gel electrophoresis. 

EXAMPLE 6 

35 (Cloning of a gene coding for the protein and base sequence determination) 

The following two oligonucleotide primers were designed based on a presumption that Xaa will be Cys from the 
standpoint of tfie sequences Lys-Val-Val-Gly-Arg-Xaa-Arg and Xaa-Gln-Leu-Phe-Val-Tyr-Gly-Gly selected from the par- 
tial amino acid sequence (SEQ ID N0:2 in the sequence listing) of the protein obtained in Example 2. 

40 

Primer 1 : 5'-AAGGTNGTNGGNMGNTGYMG-3' (SEQ ID NO: 5 in the sequence listing); and 
Primer 2: 5'-CNCCGTANACGAANARYTGRC-3' (SEQ ID NO: 6 in the sequence listing). 

(In the above sequences, N indicates T or G, M indicates A or C. Y indicates T or C, and R indicates A or G.) 

45 Then, total RNA was prepared from MKN45 cells by tiie method described in Anal. Biochem.. 162. 156 (1987) and 
applied to on oligo(dT)-cellulose column, whereby poly(A)+RNA was obtained. 

Using the tiius-obtained poly(A)+RNA as the template, RT-PCR (reverse transcription-polymerse chain reaction; cf. 
K. Hayashi (ed.): "PCR Ho no Saishin Gijutu (State-of-art Techniques of PGR)", page 44 and page 52. published Feb. 
5. 1995 by Yohdosha) was carried out. The reaction mixbjre obtained by tfiis RT-PCR was analyzed by polyacrylamide 

50 gel electrophoresis, whereupon a DNA fragment of about 85 bp was detected. Therefore, this DNA fragment was 
extieicted from the polyacrylamide gel. followed by phenol -chloroform extraction and ethanol precipitation, whereby the 
DNA fragment was recovered. The base sequence of the DNA fragment was determined by the dideoxy method. Fur- 
ther, tills DNA fragment was labeled with ^^P by the method described in "Molecular Cloning" (Cold Spring Harbor Lab- 
oratory. 1982) and used as a screening probe. 

55 Using the MKN45 cell line-derived poly(A)+RNA together with a cDNA synthesis kit (Pharmacia). cDNA was syn- 
thesized, and a phage library was constructed, as a litwary for screening, with X ZAPII (Strategene) as the vector. 
Escherichia coli XL Blue (Strategene) was infected with the above phage library to give about 100,000 plaques. 

After overnight culture in NZY medium, the bacterial cells were transferred to a Gene Screening Plus membrane 
(du pont). The membrane was placed on a filter paper impregnated with 0.1 M sodium hydroxide-0.5 M Tris hydrochlo- 
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ride buffer (pH 7.5) and allowed to stand for 2 minutes and then placed on a filter paper impregnated with 1 .5 M sodium 
chloride-0.5 M Tris hydrochloride buffer (pH 7.5) and allowed to stand for 5 minutes. After two more repetitions of this 
series of treatments, the membrane was washed with 2 x SSC (two-fold concentrated SSC) and air-dried on a dry filter 
paper. This membrane was irradiated with UV light at a dose of 120 mJ/cm^ for fixation of the DNAs transferred to the 

s membrane. The thus-treated membrane was immersed in 50 ml of a solution comprising 50 mM Tris hydrochloride 
buffer (pH 7.5). 1 M sodium chloride and 1% SDS and maintained in that state at 65**C for 2 hours. Then, the membrane 
was immersed in 40 ml of a solution comprising 5 ng/ml of the above-mentioned ^P-labeled probe, 100 ^g/ml of 
salmon sperm. 50 mM Tris hydrochloride buffer (pH 7.5), 1 M sodium chloride and 1% SDS and maintained in that state 
at 65*'C for 16 hours. Thereafter, this membrane was washed with 2 x SSC at room temperature over 5 minutes and 

10 then with two portions of 0. 1 x SSC at room temperature over 30 minutes, and subjected to autoradiography, which gave 
22 positive clones supposedly containing cDNA for the protein. Then, each positive clone was submitted to plasmid 
construction by the excision method (Toyobo Ufe Science Catalog, pages 114-115) according to Toyotx) Life Science 
Catalog (pages 114-115) and Strategene's manual. 

The thus obtained plasmid DNAs were then cleaved with the restriction enzyme Eco RI and subjected to agarose 

IS gel electrophoresis, and a clone with the longest cDNA for the protein inserted therein was selected. Then, by analyzing 
the base sequence of the plasmid (pHAI-ll) hartxxed by this done, the whole base sequence of the gene coding for the 
protein was determined (SEQ ID NO:4 in the sequence listing). 

EXAMPLE 7 

20 

(Preparation of an expression plasmid for the protein) 

A 10-ng portion of the plasmid (pHAl-lI) containing the cDNA for the protein as obtained in Example 6 was sub- 
jected to cleavage with the restriction enzyme Eco RI. followed by agarose gel electrophoresis. Thus, an about 1.4 kb 

25 Eco RI- Eco RI DNA fragment containing the cDNA for the protein was separated and extracted. The thus-obtained DNA 
fragment wag rendered Wunt-ended by the conventional method using T4 DNA polymerase and. then, the DNA frag- 
ment was purified by phenol-chloroform extraction and ethanol precipitation and dissolved in 10 nl of water. Separately. 
0.05 ng of the expression vector pME18S (Medical Immunology. 2Q. 27 (1990)) was cleaved in advance with the restric- 
tion enzyme Xho l. and the DNA fragment obtained was rendered blunt-ended by the conventional method using T4 

30 DNA polymerase and then purified by phenol-chloroform extraction and ethanol precipitation. This was dissolved in 400 
jil of a 1 mM MgCl2 solution in 50 mM Tris-HCI (pH 8), 1 unit of bacterial alkaline phosphatase (Toyobo. BAP-101) was 
added, and dephosphorylation treatment wag conducted at SS^'C for 30 minutes. Then, the DNA fragment was purified 
from this reaction mixture by phenol-chloroform extraction and ethanol precipitation and dissolved in 10 of water. 
Ligation reaction was earned out in 20 ^l of a reaction mixture (66 mM Tris-HCI. pH 7.6, 6.6 mM MgCl2. 10 mM dithio- 

35 threitd. 66 ^M ATP) containing 0.01 ^g of tiie pME18S vector-derived DNA fragment prepared as mentioned above and 
0.1 ^g of the above-mentioned blunt-ended EcoRI fragment of cDNA for the protein in the presence of T4 DNA ligase 
(Toyobo LGA-101) at ^A^C for 12 hours. A 10-jJ portion of this T4 DNA ligase reaction mixture was used to transform 
Escherichia coli HB101 (Takara Shuzo) according to the manual attached thereto. The microorganism was cultured on 
a medium containing 50 ^g/ml of ampicillin and scores of ampicill in-resistant strains were obtained. These transform- 

40 ants were analyzed by the method of Maniatis et al, ("Molecular Cloning". Cold Spring Harbor Laboratory, pages 86-96 
(1982)) and, as a result, a plasmid. pME18S-HAI-ll. containing the gene coding for the protein as inserted at the Xho l 
restriction enzyme cleavage site occurring between the pronnoter and polyadenylation site of the expression vector 
pME18S could be obtained. Its structure is shown in Rg. 2. 

45 EXAMPLE 8 

(Obtaining of a cell line expressing tiie protein) 

The plasmid pME18S-HAl-ll constructed in Example 7 and containing the cDNA for the protein as inserted at the 
50 Xho l restriction enzyme cleavage site of tiie expression vector pME18S was recovered and purified from the recom- 
binant Escherichia coli by the metiiod of Maniatis et al. ("Molecular Cloning", Cold Spring Harbor Laboratory, pages 86- 
96 (1982)) and thus a large amount of the expression plasmid DNA for the protein was obtained. COS cells were trans- 
formed by transfection tiiereof with the expression plasmid DNA. Thus. COS cells were first cultured in eRDF medium 
containing 10% FBS (total bovine serum) in tissue culture dishes 9 cm in diameter until a semiconfluent condition. 
55 Then, the medium was removed from the dishes, and a DNA solution prepared as mentioned below was added drop- 
wise thereto as mentioned below. First, for each dish 9 cm in diameter, a solution was prepared in an Eppendorf cen- 
trifuge tube by adding thereto 300 ^il of 2 x NEBS solution (2 x HEBS solution: 1 .6% sodium chloride, 0.074% potassium 
chloride. 0.05% disodium hydrogen phosphate dodecahydrate. 0.2% dextrose. 1% HEPES (pH 7.05)) and 1 0 ng of the 
plasmid DNA and making the volume 570 ^1 with sterilized water. Then, while adding 30 ^il of 2.5 M calcium chloride 
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solution to the DNA solution, the tube contents were stirred vigorously using a vortex mixer for several seconds. The 
resulting mixture was allowed to stand at room temperature for 30 minutes, with occasional stirring at intervals of atwut 
10 minutes using a vortex mixer. Tlie thuSiDrepared DNA solution was laid on the cells mentioned above and the whole 
was allowed to stand at room tenperature for 30 minutes. Then. 9 ml of eRDF medium (Kyokuto Pharmaceutical) sup- 

5 plemented with 10% FBS was added to each dish and Incubation was performed at 37°C for 4 to 5 hours in the pres- 
ence of 5% CO2. Then, the medium was removed from each dish and the cells were washed with 5 ml of 1 x TBS++ 
solution (1 x TBS++ solution: 25 mM Tris-hydrochloride (pH 7.5). 140 mM sodium chloride. 5 mM potassium chloride, 
0.6 mM disodium hydrogen phosphate, 0.08 mM calcium chloride, 0.08 mM magnesium chloride). After removing the 
1 X TBS++ solution, the cells were covered with 5 ml/dish of 1 x HEBS solution containing 10% DMSO (dimethyl sulfox- 

10 ide) and allowed to stand at room temperature for 1 to 2 minutes. The supernatant was then removed. The cells were 
again washed with 5 ml of 1 x TBS++ solution, 10 ml of eRDF medium supplemented with 10% FBS was added to each 
dish and Incubation was performed at 37**C in the presence of 5% CO2. After the lapse of 48 hours, the medium was 
recovered. The supernatant recovered was 20-fold concentrated and assayed for inhibitory activity against HGF activa- 
tor in the same manner as in Example 4. The inhibitory activity was confirmed. 

15 WNIe the invention has been described in detail and with reference to specific embodiments thereof. H will be 
apparent to one skilled in the art thai various changes and modifications can be made therein without departing from 
the spirit and scope thereof. 
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SEQUENCE LISTING 
INFORMATION FOR SEQ ID NO: 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH I 15 amino acids 

(B) TYPEt amino acid 
(D) TOPOLOGY: linoar 

(ii) MOLECULE TYPE: peptldo 

(V) FRAGMENT TYPE? N-terminal fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapians 

(B) STRAIN: MKN45 

(X} SEQUENCE DESCRIPTION: SEQ ID NO: 1 

Ala Asp Arg Glu Arg Ser lie His Asp Phe Xaa Leu Val Ser Lys 
15 10 15 

xaa: unkno%m amino acid 
INFORMATION FOR SEQ ID NO: 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH! 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(V) FRAGMENT TYPE: internal fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(6) STRAIN: MKN45 
(x) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
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Lys Val Val Gly Arg Xaa Arg Ala Ser Met Pro Arg Trp Trp Tyr Asn 

5 10 15 

Val Thr Asp Gly Sor Xaa Gin Lou Ph« Val Tyr Gly Gly 
20 25 

xaat unlcnovm amino acid 
INFORMATION FOR SEQ ID NOt 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 26 amino acids 

(B) TYPES amino acid 
(0) TOPOLOGY I linear 

(ii) MOLECULE TYPE: peptide 

(V) FRAGMENT TYPE: internal fragment 
(vl) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: MKN45 

(X) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

Ala Thr Val Thr Glu Asn Ala Thr Gly Asp Lou Ala Thr Ser Arg Asn 

15 10 15 

Ala Ala Asp Ser Ser Val Pro Ser Ala Pro 

20 25 
INFORMATION FOR SEQ ID NO: 4 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 759 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS t double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPEt cDNA to mRNA 

(iv) ANTISBMSE: no 

(v) ORIGIKAL SOURCES 

(A) ORGANISM 1 Homo sapims 

(B) STRAIN} MKK45 
(Ix) FEATURES! 

(A) NAME/KEY s coding eequence 

(B) LOCATION: 1 tO 759 

(C) IDENTIFICATION METHOD; by experiment 

(A) NAME/KEY I signal peptide 

(B) LOCATION: 1 to 81 

(C) IDENTIFICATION METHOD: by experiment 

(A) NAME/KEY: mature peptide 

(B) LOCATION: 82 to 759 

(C) IDENTIFICATION METHOD: by experiment 
(X) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

ATG GCG CAG CTG TGC GGG CTG AGG CGG AGC CGG GCG TTT CTC GCC CTG 48 
Hmt Ala Gin Leu Cyo Gly Leu Arg Arg Ser Arg Aim Phe Leu Ala Leu 

15 10 13 

CTG GGA TOG CTG CTC CTC TCT GCC GTC CTG GCG GCC GAC CGA GAA CCC 96 
Lou Gly Ser Leu Leu Leu Ser Gly Val Leu Ala Ala Asp Arg Glu Arg 

20 23 30 

AGC ATC CAC GAC TTC TGC CTG GIG TCG AAG GTG GTC CGC AGA TGC CGG 1A4 
Ser He His Asp Phe Cy« Leu Val Ser Ly8 Vtl Val Gly Arg Cy> Arg 
35 ^0 45 
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15 



20 



25 



30 



35 



40 



GCC TCC 
Ala Ser 
50 

CTC TTT 
Leu Fhe 

65 
AA6 GAG 
Lyi Glu 

GGT GAC 
Gly Aop 

GOT CCC 
Ala Fro 

TAT GAA 
Tyr Glu 
ISO 
TCC TTC 
Ser Fhe 

TTC ATC 
Phe He 



ATG CCT AGG TGG 
Met Pro Arg Trp 



CTG TAT 
Val Tyr 

GAG TGC 
Glu Cys 

CTG GCC 
Leu Ala 
100 
AGA AGG 
Arg Arg 
115 

GAA TAG 
Glu Tyr 



GGG GGC 
Gly Gly 
70 

CTC AAG 
Leu Lyfl 

85 
ACC AGC 
Thr Ser 

CAG GAT 
Gin Asp 

TGC ACC 
Cyfi Thr 



50 



CCA CGC TGG TAG 
Pro Arg Trp Tyr 
150 

TAT GGA GGC TCC 
Tyr Gly Gly Cys 
165 



TGG TAG AAT GTC ACT GAC GGA 
Trp Tyr Aen Val Thr Aep Gly 

55 60 
TGT GAC GGA AAC AGC AAT AAT 
Cye Asp Gly Atn Ser Afn Am 
75 

AAA TGT GCC ACT GTC ACA GAG 
Lys Cyt Ala Thr Val Thr Glu 
90 

AGG AAT GCA GCG GAT TCC TCT 
Arg Am Ala Ala Asp Ser Ser 
105 

TCT GAA GAC CAC TCC AGC GAT 
Ser Glu Atp His Ser Ser Asp 
120 125 
GCC AAC GCA CTC ACT GGG CCT 
Ala Am Ala Val Thr Gly Pro 
135 140 
TTT GAC GTG GAG AGC AAC TCC 
Phe Asp Val Glu Arg Asn Ser 
155 

CGG GGC AAT AAG AAC AGC TAG 
Arg Gly Asn Lys Asn Ser Tyr 
170 



TCC TGC CAG 
Ser Cys Gin 

TAG CTG ACC 
Tyr Leu Thr 
80 

AAT GCC ACG 
Asn Ala Thr 
95 

GTC CCA AGT 
Val Pro Ser 
110 

ATG TTC AAC 
Met Phe Asn 

TGC CGT GCA 
Cys Arg Ala 

TGC AAT AAC 
Cys Asn Asn 
160 

CGC TCT GAG 
Arg Ser Glu 
175 



192 



240 



288 



336 



384 



432 



680 



526 



55 
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15 



CAG CCC TCC ATC CTC CCC TGC TTC CGC CAC CAC CAC AAT CCT CCC CTG 576 
91u Ala Cj9 Met Leu Atg C/i Ar% Gin Gin Glu Asn Pro Fro Leu 

lao 185 190 

CCC CTT CCC TCA AAG GTC- GTC CTT CTC CCG CGC CTC TTC CTG ATC CTG 624 
Pro Leu Cly ficr Lye Val Val Val Leu Ala Gly L»u Phe Val Met Val 

193 200 205 

TTC ATC CTC TTC CTC CCA CCC TCC ATC GTC TAC CTC ATC CCC CTG CCA 672 
Leu lie Leu The Leu Gly Ala 5er Met Val Tyr Leu He Arg Val Ala 

210 215 220 

CCC ACC AAC CAG CAC CGT CCC CTG CGC ACC GTC TCC ACC TCC CCA CAT 720 
Arg Arg Aan Gin Glu Ar& Ala Leu Arg Thr Val Trp Ser Ser Gly Asp 
225 230 235 240 

CAC AAC CAC CAC CTC GTC AAC AAC ACA TAT CTC CTG TCA 760 
Alp Lya Glu Gin Leu Val Lye A»n Thr Tyr Vol Leu * 

2<i5 250 
INFORMATION FOR SEQ ID NOt 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LLTiGTH: 20 nucleotides 

(B) TVPEi nuclQic acid 

(C) STRANOEDNESS: 8 ingle stranded 
40 (D) TOPOLOGY: linear 

MOLECULE TYPEt other nucloic acid, synthetic DNA 
(x) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

^ AAGGTNGTNG GNWGNTGYMG 

Nt T ar G, M: A or C, Y: T or And Rt A or G 

50 



35 



55 
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INFORMATION FOR SEQ ID NOt 6 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 21 nucleotides 

(B) TYPES nucleic acid 

(C) STRANDEDNESS 1 Single Stranded 
(0) TOPOLOGY I linear 

(ii) MOLECULE TYPE: other nucleic acid, synthetic DNA 
(X) SEQUENCE DESCRIPTION! SEQ ID NOs 6 

CNCCGTANAC GAANARYTGR C 

N: T or Gf Mt A or C, Y: T or C, and Rt A or G 
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SEQUENCE LISTING 



10 



(1) GENERAL INFORMATION: 

(1) APPLICANT: 

(A) NAME: MITSUBISHI CHEMICAL CORPORATION 

(B) STREET: 5-2, Marunouchl 2-chome/ Chiyoda-ku 

(C) CITY: Tokyo 

(E) COUNTRY: Japan 

(F) POSTAL CODE (ZIP): none 

(il) TITLE OF INVENTION: NOVEL PROTEIN, DNA CODING FOR SAME 
AND METHOD OF PRODUCING THE PROTEIN 

15 (iii) NUMBER OF SEQUENCES: 7 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM; PC-DOS /MS-DOS 

20 (D) SOFTWARE: Patentin Release #1,0, Version #1,30 (EPO) 

(V) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: EP 96 111 861,9 



25 



35 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: N- terminal 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: MKN45 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Ala Asp Arg Glu Arg Ser lie His Asp Phe Xaa Leu Val Ser Lys 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 £uaino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



55 
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(V) FRAGMENT TYPE: internal 

(vX) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: MKN45 



10 



15 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Lys Val Val Gly Arg Xaa Arg Ala Ser Met Pro Arg Trp Trp Tyr Asn 
15 10 15 

Val Thr Asp Gly Ser Xaa Gin Leu Phe Val Tyr Gly Gly 
20 25 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
^ (C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
25 (V) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: MKN45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ala Thr Val Thr Glu Asn Ala Thr Gly Asp Leu Ala Thr Ser Arg Asn 
15 10 15 

Ala Ala Asp Ser Ser Val Pro Ser Ala Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 759 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Homo sapiens 

(B) STRAIN: MKN45 



55 
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<ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION;!. ,756 

(C) IDENTIFICATION METHOD: experimental 

* (D) OTHER INFORMATION :/evldence= EXPERIMENTAL 

(ix) FEATURE: 

(A) NAME/KEY: sig_j>€ptide 

(B) LOCATION:!. ,81 

(C) IDENTIFICATION METHOD: experimental 

^0 (D) OTHER INFORMATION: /evidence* EXPERIMENTAL 

(ix) FEATURE: 

(A) NAME/KEY: mat_j>eptide 

(B) LOCATION: 82. .756 

(C) IDENTIFICATION METHOD: experimental 

15 (D) OTHER INFORMATION: /evidence= EXPERIMENTAL 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATG GCG GAG CTG TGC GGG CTG AGG CGG AGC CGG GCG TTT CTC GCC CTG 48 
20 Met Ala Gin Leu Cys Gly Leu Arg Arg Ser Arg Ala Phe Leu Ala Leu 

-27 -25 -20 -15 

CTG GGA TCG CTG CTC CTC TCT GGG GTC CTG GCG GCC GAC CGA GAA CGC 96 
Leu Glv Ser Leu Leu Leu Ser Gly Val Leu Ala Ala Asp Arg Glu Arg 
-10 -5 1 5 

AGC ATC CAC GAC TTC TGC CTG GTG TCG AAG GTG GTG GGC AGA TGC CGG 144 
Ser He His Asp Phe Cys Leu Val Ser Lys Val Val Gly Arg Cys Arg 
10 15 20 

GCC TCC ATG CCT AGG TGG TGG TAC AAT GTC ACT GAC GGA TCC TGC CAG 192 
30 Ala Ser Met Pro Arg Trp Trp Tyr Asn Val Thr Asp Gly Ser Cys Gin 

25 30 35 

CTG TTT GTG TAT GGG GGC TGT GAC GGA AAC AGC AAT AAT TAC CTG ACC 240 
Leu Phe Val Tyr Gly Gly Cys Asp Gly Asn Ser Asn Asn Tyr Leu Thr 
40 45 50 

AAG GAG GAG TGC CTC AAG AAA TGT GCC ACT GTC ACA GAG AAT GCC ACG 288 
Lys Glu Glu Cys Leu Lys Lys Cys Ala Thr Val Thr Glu Asn Ala Thr 
55 60 65 

GGT GAC CTG GCC ACC AGC AGG AAT GCA GCG GAT TCC TCT GTC CCA AGT 336 
40 Gly Asp Leu Ala Thr Ser Arg Asn Ala Ala Asp Ser Ser Val Pro Ser 

70 75 80 85 

GCT CCC AGA AGG CAG GAT TCT GAA GAC CAC TCC AGC GAT ATG TTC AAC 384 
Ala Pro Arg Arg Gin Asp Ser Glu Asp His Ser Ser Asp Met Phe Asn 
90 95 100 

^ TAT GAA GAA TAC TGC ACC GCC AAC GCA GTC ACT GGG CCT TGC CGT GCA 432 

Tyr Glu Glu Tyr Cys Thr Ala Asn Ala Val Thr Gly Pro Cys Arg Ala 
105 110 115 



18 



EP 0 758 682 A2 



TCC TTC CCA CGC TGG TAG TTT GAC GTG GAG AGG AAC TCC TGC AAT AAC 480 
Ser Phe Pro Arg Trp Tyr Phe Asp Val Glu Arg Asn Ser Cys Asn Asn 
120 125 130 

^ TTC ATC TAT GGA GGC TGC CGG GGC AAT AAG AAC AGC TAC CGC TCT GAG 528 

Phe lie Tyr Gly Gly Cys Arg Gly Asn Lys Asn Ser Tyr Arg Ser Glu 
135 140 145 

GAG GCC TGC ATG CTC CGC TGC TTC CGC CAG CAG GAG AAT CCT CCC CTG 576 
Glu Ala Cys Met Leu Arg Cys Phe Arg Gin Gin Glu Asn Pro Pro Leu 
150 155 160 165 

CCC CTT GGC TCA AAG GTG GTG GTT CTG GCG GGG CTG TTC GTG ATG GTG 624 
Pro Leu Gly Ser Lys Val Val Val Leu Ala Gly Leu Phe Val Met Val 
170 175 180 

TTG ATC CTC TTC CTG GGA GCC TCC ATG GTC TAC CTG ATC CGG GTG GCA 672 
Leu lie Leu Phe Leu Gly Ala Ser Met Val Tyr Leu lie Arg Val Ala 
185 190 195 



20 



25 



CGG AGG AAC CAG GAG CGT GCC CTG CGC ACC GTC TGG AGC TCC GGA GAT 720 
Arg Arg Asn Gin Glu Arg Ala Leu Arg Thr Val Trp Ser Ser Gly Asp 
200 205 210 

GAC AAG GAG CAG CTG GTG AAG AAC ACA TAT GTC CTG TGA 759 
ASP Lys Glu Gin Leu Val Lys Asn Thr Tyr Val Leu 
215 220 225 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ala Gin Leu Cys Gly Leu Arg Arg Ser Arg Ala Phe Leu Ala Leu 
-27 -25 -20 -15 

Leu Gly Ser Leu Leu Leu Ser Gly Val Leu Ala Ala Asp Arg Glu Arg 
-10 -5 1 5 

40 Ser He His Asp Phe Cys Leu Val Ser Lys Val Val Gly Arg Cys Arg 

10 15 20 

Ala Ser Met Pro Arg Trp Trp Tyr Asn Val Thr Asp Gly Ser Cys Gin 
25 30 35 

45 Leu Phe Val Tyr Gly Gly Cys Asp Gly Asn Ser Asn Asn Tyr Leu Thr 
40 45 50 

Lys Glu Glu Cys Leu Lys Lys Cys Ala Thr Val Thr Glu Asn Ala Thr 
55 60 65 

50 



35 



55 
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Gly Asp Leu Ala Thr Ser Arg Asn Ala Ala Asp Ser Ser Val Pro Ser 
70 75 80 85 

Ala Pro Arg Arg Gin Asp Ser Glu Asp His Ser Ser Asp Met Phe Asn 
90 95 100 

Tyr Glu Glu Tyr Cys Thr Ala Asn Ala Val Thr Gly Pro Cys Arg Ala 
105 110 115 

Ser Phe Pro Arg Trp Tyr Phe Asp Val Glu Arg Asn Ser Cys Asn Asn 
120 125 130 

Phe lie Tyr Gly Gly Cys Arg Gly Asn Lys Asn Ser Tyr Arg Ser Glu 
135 140 145 

Glu Ala Cys Met Leu Arg Cys Phe Arg Gin Gin Glu Asn Pro Pro Leu 
150 155 160 165 

Pro Leu Gly Ser Lys Val Val Val Leu Ala Gly Leu Phe Val Met Val 
170 175 180 

Leu lie t^u Phe Leu Gly Ala Ser Met Val Tyr Leu lie Arg Val Ala 
185 190 195 

Arg Arg Asn Gin Glu Arg Ala Leu Arg Thr Val Trp Ser Ser Gly Asp 
200 205 210 

25 Asp Lys Glu Gin Leu Val Lys Asn Thr Tyr Val Leu 

215 220 225 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 
35 (A) DESCRIPTION: /desc = "synthetic DNA" 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION :group( 6, 9, 12, 15) 

40 (D) OTHER INFORMATION: /note= "N is T or G" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AAGGTNGTNG GNMGNTGYMG 

45 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(ix) FEATURE: 

(A) NAME /KEY: misc^feature 

(B) LOCATION :group( 2, 8, 14) 

(D) OTHER INFORMATION :/note= "N is T or G" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CNCCGTANAC GAANARYTGR C 21 



Claims 

20 1 , A protein having the following physico-chemical properties: 

(1) a molecular weight of about 30.000 daftons as determined by SDS-polyacrylamide gel electrophoresis: 

(2) inhibitory activity on the protease activity of hepatocyte growth factor activator; and 

(3) one of the amino add sequences depicted in the sequence listing under SEQ ID NO:1 through 3 or an 
25 amino acid sequence substantially equivalent thereto. 

2. A protein having one of the amino acid sequences depicted in the sequence listing under SEQ ID NO:1 through 3 
or an amino acid sequence sul>stantially equivalent thereto and having inhibitory activity on the protease activity of 
hepatocyte growth factor activator. 

30 

3. A protein having the amino acid sequence depicted in the sequence listing under SEQ ID NO:4 or an amino acid 
sequence substantially equivalent thereto. 

4. A protein having, as its amino acid sequence, that segment of the amino acid sequence depicted in the sequence 
35 listing under SEQ ID NO;4 which starts with the 28th amino acid (alanine) residue and ends with the 252nd amino 

acid (leucine) residue, or an amino acid sequence substantially equivalent thereto. 

5. A DNA coding for the protein of Claim 1 . 
40 6. A DNA coding for the protein of Claim 2. 

7. A DNA coding for the protein of Claim 3. 

8. A DNA as claimed in Claim 7 which is represented by the base sequence depicted in the sequence listing under 
45 SEQ ID N0:4 or a base sequence substantially equivalent thereto. 

9. A DNA coding for the protein of Claim 4. 

10. A DNA as claimed in Claim 9 which is represented by that segment of the base sequence depicted in the sequence 
50 listing under SEQ ID NO:4 which starts wrth the 82nd nucleotide (guanine) and ends with the 759th nucleotide 

(adenine), or a base sequence substantially equivalent thereto. 

1 1 . A gene coding for the protein of Claim 1 . 
55 12. A gene coding for the protein of Claim 2. 

13. A gene coding for the protein of Claim 3. 

14. A gene as claimed in Claim 13 which is represented by the base sequence depicted in the sequence listing under 
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SEQ ID N0:4 or a base sequence substantially equivalent thereto. 
15. A gene coding for the protein of Claim 4. 

5 16. A gene as claimed in Claim 15 which is represented by that segment of the base sequence depicted in the 
sequence listing urxjer SEQ ID NO:4 which starts with the 82nd nucleotide (guanine) and ends with the 759th 
nucleotide (adenine), or a base sequence substantially equivalent thereto. 

17. An expression vector which comprises the DNA or gene of one of Claims 5 through 16. 

10 

18. A transformant as obtained by transformation of a host cell with the expression vector of Claim 1 7. 

19. A transformant as claimed in Claim 18. wherein the host cell is an animal cell. 

15 20. A method of producing a protein having inhibitory activity on the protease activity of hepatocyte growth factor acti- 
vator which conrprises cultivating the transformant of Claim 18 or 19 to thereby produce the protein having inhibi- 
tory activity on the protease activity of hepatocyte growth factor activator. 

21. A method of protein production as claimed in Claim 20, wherein the protein is a protein defined in one of Claims 1 
20 through 4. 
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Fig. 2 
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(54) Hepatocyte growth factor activator inhibitor 
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tor, for transforming host cells. Cultivation of the trans- 
formant gave the desired protein. The protein can be 
used as an in vivo or in vitro control factor for HGF or 
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kinetic studies of the protein, or as a starxiard in assay 
systems therefor 



CO 
< 

CM 
CO 
CD 

CO 
LO 

o 

Q. 
UJ 



Printed by Bank Xerox (UK) Business Services 
2.14.20/3.4 



EP 0 758 682 A3 



J) 



European Patent 
Office 



EUROPEAN SEARCH REPORT 



AppUcmtloa Nmmbcr 

EP 96 11 1861 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Ck«ti«a of document mth indicadoa, wbere appropriate. 
of rc4cv«ot pa»aec« 



Raevaot 

to i 



CLASSinCATION OF THE 
APTUCATION aat.CL6) 



P,A 



EMBL/GENBANK DATABASES Accession no R35464 
Sequence reference HS46499, May 4, 1995 
L.HILLIER ET AL: "The WashU-Merck EST 
Project" 
XPOO2039653 

* the whole document * 

EMBL/GENBANK DATABASES Accession no N39798 
Sequence reference HS798277, January 26, 
1996 

L. HILLIER ET AL: "The WasU-Merck EST 

Project" 

XP002039654 

* the whole document * 

EMBL/GENBANK DATABASES Accession no R74593 
Sequence reference HS593137,June 9, 1995 
L.HILLIER ET AL: "The WashU-Merck EST 
Project" 
XP002039655 

* the whole document * 

JOURNAL OF BIOLOGICAL CHEMISTRY, 

vol. 268. no. 14, 15 May 1993, MD US. 

pages 10024-10028, XP002039578 

K. MIYAZAWA ET AL: "Molecular cloning and 

sequence analysis of the cDNA for a human 

serine protease responsible for activation 

of hepatocyte growth factor" 



1-21 



C12N15/15 
C07K14/81 
C12N5/10 



1-21 



1-21 



TiXHNICAL FIELDS 
SEARCHED (Int. 0,6) 



C07K 



JOURNAL OF BIOLOGICAL CHEMISTRY, 
vol. 271. no. 7, 16 February 1996, MD 
pages 3615-3618. XP002039579 
K- MIYAZAWA ET AL: "Activation of 
hepatocyte growth factor in the injured 
tissues is mediated by hepatocyte growth 
factor activator" 



US, 



The present search report has been drawn up for aU daims 



THE HAGUE 



4 September 1997 



Van der Schaal, C 



CATEGORY OF OTED DOCUMENTS 

X : parti culariy rdcvmal If taken alooc 

Y : pnttculvly rdevaat tf camblacd with another 

docnmettt of the same category 
A : technoLoglcal backgronnd 
O : non-written disclosure 
P : iDtcnaedtatc docnnieat 



T : theoiy or prindpte itnderiylae the invention 
E : earlier patent tfocuBcat, but published on, or 

after the filing date 
D : document dted in the application 
L : docwaoil dted for other reasons 

A : member of the sane patent fomity, corresponding 
docuncnt 



